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Leisingera nanliaiensis DSM 24252^ is a Gram-negative, motile, rod-shaped marine 
Aiphaproteobacterium, isolated from sandy marine sediments. Here we present the non- 
contiguous genome sequence and annotation together with a summary of the organism's pheno- 
typic features. The 4,948,550 bp long genome with its 4,832 protein-coding and 64 RNA genes 
consists of one chromosome and six extrachromosomal elements with lengths of 236 kb, 92 kb, 
61 kb, 58 kb, 56 kb, and 35 kb, respectively. The analysis of the genome showed that DSM 
24252^ possesses all genes necessary for dissimilatory nitrite reduction, and the strain was shown 
to be facultatively anaerobic, a deviation from the original description that calls for an emendation 
of the species. Also present in the genome are genes coding for a putative prophage, for gene- 
transfer agents and for the utilization of methylated amines. Phylogenetic analysis and 
intergenomic distances indicate that L nanhaiensis might not belong to the genus Leisingera. 



Introduction 

The genus Leisingera was proposed by Schaefer et 
al. in 2002 [1] and belongs to the family 
Rhodobacteraceae within the class 
Alphaproteobacteria. The genus currently consists 
of three species with validly published names, 
with Leisingera metliylolialidivorans as the type 
species. The genus was named in honor of Thomas 
Leisinger on the occasion of his retirement and for 
his contributions to our understanding of the bio- 
chemistry of bacterial methyl-halide metabolism. 
NH52FT (= DSM 24252t = LMG 24841T = ATCC 
BAA-92T) is the type strain of L. nanhaiensis and 
was isolated from marine sandy sediment taken 



from the South Chinese Sea [2]. The species name 
is referring to Nanhai, the Chinese name for the 
South China Sea. The other two Leisingera species 
were isolated from seawater and a marine 
electroactive biofilm, respectively [1,3]. All three 
Leisingera species are able to grow on methylated 
amines as the sole N source [4] and, at least for L. 
methylohalidivorans, the ability to grow on methyl 
halides as sole carbon source was described [1]. 

Here we present a summary classification and fea- 
tures of L. nanhaiensis DSM 24252T together with 
the description of the non-contiguous genomic 
sequence and annotation. 
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Classification and features 

16S rDNA analysis 

A representative genomic 16S rDNA sequence of L. 
nanhaiensis DSM 24252T was compared with the 
Greengenes database for determining the 
weighted relative frequencies of taxa and (trun- 
cated) ke3words as previously described [5]. The 
most frequently occurring genera were 
Phaeobacter (51.0%), Roseobacter (20.2%), 
Silicibacter (7.6%), Leisingera (5.5%) and Nautella 
(3.9%) (75 hits in total). Regarding the four hits to 
sequences from other species of the genus, the 
average identity within HSPs was 96.7%, whereas 
the average coverage by HSPs was 99.4%. Among 
all other species, the one yielding the highest 
score was L methylohalidivorans (NR_025637), 
which corresponded to an identity of 96.8% and 
an HSP coverage of 100.1%. [Note that the 
Greengenes database uses the INSDC (= 
EMBL/NCBI/DDBJ) annotation, which is not an 
authoritative source for nomenclature or classifi- 
cation.] The highest-scoring environmental se- 
quence was AJ296158 (Greengenes short name 
'Spain:Galicia isolate str. PP-154'), which showed 
an identity of 96.3% and an HSP coverage of 
100.0%. The most frequently occurring keywords 
within the labels of all environmental samples that 
yielded hits were 'microbi' (7.7%), 'marin' (7.3%), 
'water' (7.0%), 'coastal' (6.6%) and 'effect (6.6%) 
(168 hits in total). Environmental samples that 
yielded hits of a higher score than the highest 
scoring species were not found, indicating that 
this species is rarely detected in the environment. 

Figure 1 shows the phylogenetic neighborhood of 
L. nanhaiensis in a tree based on 16S rRNA gene 
sequences. The sequences of the two identical 16S 
rDNA copies in the genome do not differ from the 
previously published 16S rRNA gene sequence 
(FJ232451). 

Our phylogenetic analysis (Figure 1, Table 1) indi- 
cates that L. nanhaiensis is not particularly closely 
affiliated with the other Leisingera species. BLAST 
results against the NCBI database with the 1,429 
bp long 16S rRNA gene sequence showed 97% 
similarity to L. methylohalidivorans strain MB2, 
Phaeobacter gallaeciensis DSM 17395 and P. 
gallaeciensis 2.10 (see also the Greengenes analy- 
sis described above). Thus a reclassification of L. 
nanhaiensis might be appropriate, but should 
probably be postponed until more genome 
sequences from the relevant genera are available. 



as the 16S rRNA gene trees are only partially re- 
solved (Figure 1). A preliminary phylogenomic 
analysis is given below. 

Morphology and physiology 

L. nanhaiensis NH52FT was originally described as 
an aerobe [2], but as genes for the dissimilatory 
reduction of nitrite could be found in the genome 
of DSM 24252T (see "Insights into the genome") 
the organisms has the genetic potential to be a 
facultative anaerobe. Cells of strain NH52FT are 
Gram-negative, motile rods, 0.62 - 0.8 x 1.6 - 2.96 
^m in size [2]. Figure 2 shows a scanning-electron 
micrograph of L. nanhaiensis DSM 24252T. NaCl is 
essential for growth, which occurs from 0.6% to 
6.0% NaCl with an optimum between 1% and 4% 
[2]. The temperature range is 4°C - 37°C (opti- 
mum 25°C) and the pH range is 6 - 9.3 (optimum 
7 - 8.5). Growth only occurs on complex sub- 
strates such as yeast extract, tryptone and pep- 
tone from potatoes, as well as betaine and methi- 
onine [2]. The color of the colonies grown on 
complex medium (M2 agar medium) is beige. The 
type strain is susceptible to a broad spectrum of 
antibiotics listed in [2]. 

Chemotaxonomy 

The main cellular fatty acids of strain NH52FT are 
(>1% of total fatty acids) Cisii aic, an unknown fat- 
ty acid (eqivalent chain-length of 11.799), Ci6:o 2-oh, 
Cio:o 3-oH, Ci6:o, 1 1-methyl Ci8:i6)7c and Ci2:0 3-oH. The 
major polar lipids are phosphatidylglycerol, 
phosphatidylethanolamine, an unidentified phos- 
pholipid, an unidentified lipid and an aminolipid 
[2]. 

Genome sequencing and annotation 

Genome project history 

This organism was selected for sequencing on the 
basis of the DOE Joint Genome Institute Communi- 
ty Sequencing Program 2010, CSP 441: "Whole 
genome type strain sequences of the genera 
Phaeobacter and Leisingera - a monophyletic 
group of physiologically highly diverse organ- 
isms". The genome project is deposited in the Ge- 
nomes On Line Database [17] and the complete 
genome sequence is deposited in GenBank. Se- 
quencing, finishing and annotation were per- 
formed by the DOE Joint Genome Institute (JGI) 
using state-of-the-art sequencing technology [35]. 
A summary of the project information is shown in 
Table 2. 
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86/91 



67/65 



73/- 



- Phaeobacler caeruleus (AM943630) ' 

— Phaeobacter daeponensis (D0981486) ' 



77/89 



- Leisingera aquimarina (AM900415) 



76/- 



' — Leisingera methylohalidivorans (AY005463) ' 

Phaeobacter arcticus (DO514304) ' 

Phaeobacter inhibens (AY177712} ' 



• Phaeobacter gallaeciensis (Y13244) " 

Seohaeicola saemankumensis (EU221274) 



- Leisingera nanhaiensis (IMG251 2553699) ' 



78/- 



100/98 



Oceanicola nitratireducens (EU581832) 

- Oceanicola nanhaiensis (DO414420) ' 
Oceanicola pacificus (DQ659449) 



- Oceanicola batsensis (AY424898) ' 

Oceanicola granulosus (AY424896) ' 



- Oceanicola marinus (D0822569) 



Figure 1. Phylogenetic tnse highlighting the position of L. nanhaiensis relative to the type strains of the other species within the 
genus Leisingera and the neighboring genera Phaeobacter, Oceanicola, and Seohaeicola [1-3,6-16]. The tree was inferred 
from 1,385 aligned characters of the 1 6S rRNAgene sequence under the maximum likelihood (ML) criterion as previously de- 
scribed [5]. Oceanicola spp. were included in the dataset for use as outgroup taxa. The branches are scaled in terms of the 
expected number of substitutions per site. Numbers adjacent to the branches are support values from 1,000 ML bootstrap rep- 
licates (left) and from 1,000 maximum-parsimony bootstrap replicates (right) if largerthan 60% [5]. Lineages with type strain 
genome sequencing projects registensd in GOLD [17] are labeled with one asterisk, those also listed as 'Complete and 
Published' with two asterisks [10,18-23]. 
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Figure 2. Scanning electron micrograph of L. nanhaiensis DSM 24252^ 
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Table 1. Classification and general features of L. nanhaiensis DSM 24252^ according to the MIGS recom- 
mendations [24] published by the Genome Standards Consortium [251. 



MIGS ID Property 



Term 



Evidence code 



MIGS-7 
MIGS-12 



MIGS-6.1 
MIGS-6.1 
MIGS-6.3 
MIGS-22 



MIGS-6 

MIGS-6.2 

MIGS-15 

MIGS-14 

MIGS-16 

MIGS-18 

MIGS-19 

MIGS-23 

MIGS-4 

MIGS-5 

MIGS-4.1 

MIGS-4.2 

MIGS-4.3 

MIGS-4.4 



Current classification 



Subspecific genetic lineage (strain] 

Reference for biomaterial 

Gram stain 

Cell shape 

Motility 

Sporulation 

Temperature range 

Optimum temperature 

Salinity 

Relationship to oxygen 
Carbon source 
Energy metabolism 
Habitat 
pH 

Biotic relationship 
Known pathogenicity 
Specific host 
Health status of host 
Biosafety level 
Trophic level 
Isolation 

Geographic location 

Time of sample collection 

Latitude 

Longitude 

Depth 

Altitude 



Domain Bacteria 

Phylum Proteobacteria 

Class Alphaproteobacteria 

Order Rhodobacterales 

Family Rhodobacteraceae 

Genus Leisingera 

Species Leisingera nanhaiensis 

NH52FT 

Sun eta/ 2010. 

Gram-negative 

Rod-shaped 

Yes 

Not reported 
4-37 °C 
25°C 
halophile 

facultatively anaerobe 

complex substrates, betaine, methionine 

Not reported 

sea water, sediment, sand 

pH 6.0-9.3 (optimal, pH 7-8.5) 

free living 

Not reported 

Not reported 

Not reported 

1 

Not reported 
sandy sediments 
South China Sea 
before 2009 
15.55 
114.49 
157 m 

Not reported 



TAS [26] 
TAS [27,28] 

TAS [28,29] 

TAS [28,30] 

TAS [28,31] 

TAS [1,3,32] 

TAS [2] 
TAS [2] 
TAS [2] 
TAS [2] 
TAS [2] 
TAS [2] 

TAS [2] 
TAS [2] 
TAS [2] 
IDA 
TAS [2] 

TAS [2] 
TAS [2] 
TAS [2] 



TAS [33] 

TAS [2] 
TAS [2] 
NAS 
TAS [2] 
TAS [2] 
TAS [2] 



Evidence codes - TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: 
Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a 
generally accepted property for the species, or anecdotal evidence); IDA: Inferred from Direct Assay. Evi- 
dence codes are from the Gene Ontology project [34]. 
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Growth conditions and DNA isolation 

A culture of DSM 24252T was grown in DSMZ me- 
dium 514 (Bacto Marine Broth) [36] at 28°C. 
gDNA was purified using Jetflex Genomic DNA Pu- 
rification Kit (GENOMED 600100] following the 
directions provided by the supplier but modified 
by the use of a 40 min incubation time. The purity, 
quality and size of the bulk gDNA preparation 
were assessed by JGI according to DOE-JGI guide- 
lines. DNA is available through the DNA Bank 
Network [37]. 

Genome sequencing and assembly 

The draft genome sequence was generated using 
lUumina data [38]. For this genome, we construct- 
ed and sequenced an lUumina short-insert paired- 
end library with an average insert size of 270 bp 
which generated 13,912,778 reads and an 
lUumina long-insert paired-end library with an 
average insert size of 7,381 ± 2,326 bp which gen- 
erated 9,786,858 reads totaling 3,555 Mbp of data 
(Feng Chen, unpublished data). All general aspects 
of library construction and sequencing can be 
found at the JGI web site [39]. The initial draft as- 
sembly contained 43 contigs in 14 scaffolds. The 
initial draft data was assembled with AUpaths and 
the consensus was computationally shredded into 
10 kbp overlapping fake reads (shreds). The 
lUumina draft data was also assembled with Vel- 
vet [40], and the consensus sequences were com- 
putationally shredded into 1.5 kbp overlapping 
fake reads (shreds). The Hlumina draft data was 
assembled again with Velvet using the shreds 
from the first Velvet assembly to guide the next 
assembly. The consensus from the second Velvet 
assembly was shredded into 1.5 kbp overlapping 
fake reads. The fake reads from the AUpaths [41] 
assembly and both Velvet assemblies and a subset 
of the niumina CLIP paired-end reads were as- 
sembled using parallel phrap (High Performance 
Software, LLC). Possible mis-assemblies were cor- 
rected with manual editing in Consed [42-44]. Gap 
closure was accomplished using repeat resolution 
software (Wei Gu, unpublished data), and se- 
quencing of bridging PGR fragments with Sanger 
technology. One round of manual/wet lab finish- 
ing was completed. A total of 43 additional se- 
quencing reactions were completed to close gaps 
and to raise the quality of the final sequence. The 
estimated size of the genome is 5 Mb and the final 
assembly is based on 3,555 Mbp of lUumina draft 
data, which provides an average 711 x coverage of 
the genome. 



Genome annotation 

Genes were identified using Prodigal [45] as part 
of the JGI genome annotation pipeline [46], fol- 
lowed by a round of manual curation using the JGI 
GenePRIMP pipeline [47]. The predicted CDSs 
were translated and used to search the National 
Center for Biotechnology Information (NCBI) 
nonredundant database, UniProt, TIGR-Fam, Pfam, 
PRIAM, KEGG, COG, and InterPro databases. Addi- 
tional gene-prediction analysis and functional an- 
notation was performed within the Integrated Mi- 
crobial Genomes - Expert Review (IMG-ER) plat- 
form [48]. 

Genome properties 

The genome consists of seven scaffolds with a total 
length of 4,948,550 bp and a G+C content of 60.7% 
(Figures 3a, 3b, 3c, 3d, 3e, 3f and Figure 3g, Table 
3). The scaffolds correspond to a chromosome 
4,411,177 bp in length and six extrachromosomal 
elements. Of the 4,896 genes predicted, 4,832 were 
protein-coding genes and 64 RNAs. The majority of 
the protein-coding genes (81.1%) were assigned a 
putative function while the remaining ones were 
annotated as hypothetical proteins. The distribu- 
tion of genes into COGs functional categories is pre- 
sented in Table 4. 

Insights into the genome 

The replication-initiation systems identified on 
the scaffolds were as follows: cNanh_4411, dnaA, 
repB-I and rep ABC-2; pNanh_A236, repABC-5; 
pNanh_B92, dnaA-Uke and repA-d; pNanh_D58, 
repABC-9; pNanh_F35, repA-a; pNanha_E56, 
repA-b and repA-c; pNanh_C61, repA-I. This justi- 
fies the interpretation of cNanh_4411 as circular 
chromosome and of the other scaffolds as circu- 
lar extrachromosomal elements [49,50]. 

Genome analysis of L. nanhaiensis DSM 24252T 
also revealed the genes for the utilization of meth- 
ylated amines (MAs). The key genes from the pro- 
posed pathway of MA metabolism code for the 
enzymes trimethylamine monooxygenase {tmm) 
and gammaglutamylmethylamide synthetase 
[gmaS]. The trimethylamine monooxygenase is a 
flavin-dependent enzyme, recently identified by 
Chen et al. [51]. Comparison of a previously pub- 
lished sequence for a trimethylamine 
monooxygenase gene in L. nanhaiensis DSM 
24252T from Chen [4] (GenBank accession num- 
ber JN797867) showed 99% sequence similarity 
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to the gene of a predicted flavoprotein involved in 
K+ transport in the genome of DSM 24252T 
(Nanh_04177). Comparison of the ^ma5 sequence 
(JN797857) with the genome showed also a 99% 
sequence similarity to a glutamine synthetase, 



type III (IMG term: gamma-glutamylmethylamide 
synthetase, EC 6.3.4.12) (Nanh_04141). These 
genes give L. nanhaiensis the potential to utilize 
MAs as alternative nitrogen sources [4]. 



Table 2. Genome sequencing project information 



MIGS ID 


Property 


Term 


MIGS-31 


Finishing quality 


Non-contiguous finished 


MIGS-28 


Libraries used 


Two lllumina paired-end libraries (2 70 bp and 8 kb insert size) 


MlGS-29 


Sequencing platforms 


lllumina GAii 




OcUUCIICIllg CUVcFagt: 


711 "v 1 1 1 1 im 1 no 
/ 1 1 A 11 1 Ui II 1 1 Id 


MIGS-30 


Assemblers 


Allpaths version r39750, Velvet 1.1.05, phrap version SPS -4.24 


MIGS-32 


Gene calling method 


Prodigal 1.4, GenePRIMP 




INSDC ID 


AXBGOOOOOOOO 




GenBank Date of Release 


August 2 3, 2013 




GOLD ID 


Gi10857 




NCBI project ID 


78295 




Database: IMG 


2512047087 


MIGS-13 


Source material identifier 


DSM 24252 



Table 3. Genome Statistics 


Attribute 


Number % 


of Total 


Genome size (bp) 


4,948,550 


100.00 


DNA coding region (bp) 


4,430,400 


89.53 


DNA G+C content (bp) 


3,005,972 


60.74 


Number of replicons 


7 




Extrachromosomal elements 


6 




Total genes 


4,896 


100.00 


RNA genes 


64 


1.31 


rRNA operons 


2 




tRNA genes 


49 


1.00 


Protein-coding genes 


4,832 


98.69 


Pseudo genes 


0 


0.00 


Genes with function prediction 


3,970 


81.09 


Genes in paralog clusters 


3,848 


78.59 


Genes assigned to COGs 


3,813 


77.88 


Genes assigned Pfam domains 


4,051 


82.74 


Genes with signal peptides 


426 


8.70 


Genes with transmembrane helices 


1,059 


21.63 
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ZVXlCa SOCODI 210000 

Figure 3a. Graphical map of the chromosome (cNanh_441 1). From outside to the center: genes on forward 
strand (color by COG categories), genes on reverse strand (color by COG categories), RNA genes (tRNAs 
green, rRNAs red, other RNAs black), GC content, GC skew. 




Figure 3b. Graphical map of the extrachromosomal element pNanh_A236. 
From bottom to top: genes on forward strand (color by COG categories), genes 
on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs 
red, other RNAs black), GC content, GC skew. 
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Figure 3c. Graphical maps of the extrachromosomai element 
pNanh_B92. From outside to the center: genes on forward strand (color 
by COG categories), genes on reverse strand (color by COG categories), 
RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, 
GC skew. 



I 




Figure 3d. Graphical maps of the extrachromosomai element 
pNanh_D58. From outside to the center: genes on forward strand 
(color by COG categories), genes on reverse strand (color by COG 
categories), RNA genes (tRNAs green, rRNAs red, other RNAs 
black), GC content, GC skew. 
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Figure 3e. Graphical maps of the extrachromosomal element 
pNanh_F35. From outside to the center: genes on forward 
strand (color by COG categories), genes on reverse strand 
(color by COG categories), RNA genes (tRNAs green, rRNAs 
red, other RNAs black), GC content, GC skew. 



30000 




Figure 3g. Graphical maps of the extrachromosomal ele- 
ment pNanh_C61. From outside to the center genes on 
forward strand (color by COG categories), genes on re- 
verse strand (color by COG categories), RNA genes 
(tRNAs green, rRNAs red, other RNAs black), GC content, 
GC skew. 



40000 



50000 — 



Figure 3f. Graphical maps of the 
extrachromosomal element pNanh_E56. From left 
to right: genes on forward strand (color by COG 
categories), genes on reverse strand (color by COG 
categories), RNA genes (tRNAs green, rRNAs red, 
other RNAs black), GC content, GC skew. 
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Table 4. Number of genes associated with the general COG functional categories 



Code 


Value 


%age 


Description 


J 


172 


4.11 


Translation, ribosomal structure and biogenesis 


A 


1 


0.02 


RNA processing and modification 


K 


299 


7.14 


Transcription 


L 


249 


5.95 


Replication, recombination and repair 


B 


4 


0.10 


Chromatin structure and dynamics 


D 


39 


0.93 


Cell cycle control, cell division, chromosome partitioning 


Y 


0 


0 


Nuclear structure 


V 


63 


1.50 


Defense mechanisms 


T 


113 


2.70 


Signal transduction mechanisms 


M 


199 


4.75 


Cell wall/membrane/envelope biogenesis 


N 


33 


0.79 


Cell motility 


Z 


0 


0 


Cytoskeleton 


w 


0 


0 


Extracellular structures 


u 


77 


1.84 


Intracellular trafficking, secretion, and vesicular transport 


o 


147 


3.51 


Posttranslational modification, protein turnover, chaperones 


c 


288 


6.88 


Energy production and conversion 


G 


209 


4.99 


Carbohydrate transport and metabolism 


E 


523 


12.49 


Amino acid transport and metabolism 


F 


88 


2.10 


Nucleotide transport and metabolism 


H 


167 


3.99 


Coenzyme transport and metabolism 


1 


247 


5.9 


Lipid transport and metabolism 


P 


217 


5.18 


Inorganic ion transport and metabolism 


Q 


159 


3.8 


Secondary metabolites biosynthesis, transport and catabolism 


R 


509 


12.16 


General function prediction only 


S 


384 


9.17 


Function unknow/n 




1,083 


22.12 


Not in COGs 




4,187 




Total 
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Interestingly, the genes tmm and gmaS of L. 
nanhaiensis DSM 24252T do not cluster with the 
corresponding genes of the other Leisingera spe- 
cies in phylogenetic trees calculated for these 
genes. The Tmm sequence (~255 amino acids) 
clusters with Ruegeria pomeroyi and Roseobacter 
denitrificans, the sequence for GmaS (~264 amino 
acids) with Ruegeria atlantica and Roseobacter sp. 
AzwK-3b [4]. 

Strain DSM 24252T encodes a gene transfer agent 
(GTA), a virus-like particle that mediates transfer 
of genomic DNA between prokaryotes without 
negative effects on the host cell [52]. The GTA 
cluster has a length of -17 kb (Nanh_00247- 
Nanh_00229) and shows structural similarities to 
GTAs of other Rhodobacterales species, e.g. 
Phaeobacter inhibens 2.10 and P. inhibens DSM 
17395 [53]. (Note that the species affiliation of P. 
gallaeciensis and P. inhibens strains had recently 
been reassessed, resulting in the assignment of 
the alleged P. gallaeciensis type-strain deposit 
DSM 17395 to P. inhibens [54].) Strain DSM 
24252T also harbors a putative prophage 
(Nanh_4518 - Nanh_4531). 

We found all genes necessary for dissimilatoiy 
nitrite reduction, including the cluster for nitrite 
reductase (Nanh_03376 - Nanh_03386), the clus- 
ter for nitric oxide reductase [Nanh_03387 - 
Nanh_03394), and the cluster for nitrous oxide 
reductase (Nanh_01753 - Nanh_01761). L. 
nanhaiensis was described as an aerobic bacte- 
rium, but only the reduction of nitrate was tested 
in the original description of this organism [2]. 
Based on the genomic information we tested 



strain DSM 242 52t for dissimilatory reduction of 
nitrite, by using anaerobic marine basal medium 
according to Cypionka and Pfennig [55] supple- 
mented with nitrite (5mM) and methionine 
(ImM). Reduction of nitrite was tested 
photometrically at 545nm after two weeks, using 
the Griess reaction [56]. 

The results showed that strain DSM 24252T is able 
to reduce nitrite under anoxic conditions, demon- 
strating that it is a facultatively anaerobic organ- 
ism. Below we propose an according emendation 
of the species description. 

Interestingly, the same was found recently for 
Phaeobacter inhibens T5t [21], which also was ini- 
tially only tested for the reduction of nitrate and 
thus described as a strictly aerobic bacterium 
[32]. A test for nitrite reduction showed, however, 
that P. inhibens is in fact a facultatively anaerobic 
bacterium [21]. Based on these findings, we sug- 
gest that anaerobic growth of roseobacters should 
not only be tested with nitrate, but also with ni- 
trite. 

As indicated by the 16S rRNA gene sequence anal- 
ysis [Figure 1), the classification of L. nanhaiensis 
might need to be reconsidered. We conducted a 
preliminary phylogenomic analysis using GGDC 
[57-59] and the draft genomes of the type strains 
of the other Leisingera and Phaeobacter species. 
The results shown in Table 5 indicate that the 
DNA-DNA hybridization (DDH) similarities calcu- 
lated in silico of L. nanhaiensis to Phaeobacter spe- 
cies are, on average, not smaller than those to oth- 
er Leisingera species. The highest value was ob- 
tained for P. arcticus. 



Table 5. DDH similarities between L. nanhaiensis DSM 24252^ and the other Leisingera and 
Phaeobacter species (with genome-sequenced type strains)^ 



Reference species 



formula 1 formula 2 formula 3 



L aquamarina (AXBEOOOOOOOO) 

L methylohalidivorans (CP006773, CP006774, CP006775) 

P. arcticus (AXBFOOOOOOOO) 

P caeruleus (AXBIOOOOOOOO) 

P daeponensis (AXBDOOOOOOOO) 

P ga//aec;ens/s (AOQAOl 000000) 

P inhibens (AXBEOOOOOOOO) 



14.50+3.11 19.20+2.28 14.70+2.55 

14.50+3.11 19.20+2.29 14.60+2.64 

14.60+3.12 22.90+2.37 14.80+2.66 

14.50+3.11 19.40+2.29 14.60+2.65 

14.70+3.13 19.60+2.30 14.80+2.66 

13.80+3.06 20.20+2.31 14.00+2.61 

13.90+3.06 19.50+2.29 14.10+2.61 



Calculated in silico with the GGDC server version 2.0 [59]. The standard deviations indicate the inherent 
uncertainty in estimating DDH values from intergenomic distances based on models derived from empirical 
test data sets (which are always limited in size); see [59] for details. The distance formulas are explained in 
[57]. The numbers in parentheses are GenBank accession numbers identifying the underlying genome se- 
quences. Tfie accession number of /.. nanhaiensis is AXBGOOOOOOOO. 
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Plasmids 

Genome sequencing of L. nanhaiensis DSM 24252T 
reveals the presence of six plasmids with sizes 
between 35 kb and 236 kb (Table 6]. The circular 
conformation of the chromosome and four 
extrachromosomal elements has been experimen- 
tally validated. The four larger plasmids contain 
characteristic replication modules of the RepABC-, 
DnaA-like and RepA-type comprising a replicase 
as well as the parAB partitioning operon [60]. The 
respective replicases that mediate the initiation of 
replication are designated according to the estab- 
lished plasmid classification scheme [61]. The dif- 
ferent numbering of, e.g., the replicases RepC-5 
and RepC-9 from RepABC-type plasmids corre- 
sponds to specific plasmid compatibility groups 
that are required for a stable coexistence of the 
replicons within the same cell [62]. The two small 
replicons pNanh_E56 and pNanh_F35 contain soli- 
tary RepA-IV type replicases without a partition- 
ing module. This distribution may correspond 
with a higher plasmid copy number within the 
cell, thus assuring the replicon maintenance in the 
daughter cells after cell division. The additional 
RepA-IV type replicase that is located on the 
DnaA-like I plasmid pNanh_B92 may originate 
from a fusion event of two formerly independent 
plasmids. 

The locus tags of all replicases, plasmid stability 

modules and the large virB4 and virD4 genes of 
type IV secretion systems are presented in Table 



7. The largest plasmid pNanh_A236 contains a 
post-segregational killing system (PSK), consisting 
of a typical operon with two small genes encoding 
a stable toxin and an unstable antitoxin [63]. 
Moreover, this RepABC-type plasmid also contains 
a complete type IV secretion system including the 
virB operon for the formation of a transmembrane 
channel. The relaxase VirD2, which is required for 
the strand-specific DNA nicking at the origin of 
transfer [priT), and the coupling protein VirD4 
support the presence of a functional conjugation 
system on this plasmid [64,65]. The presence of 
the highly conserved chromosomal genes virD2 
(Nanh_3787) and virD4 (Nanh_3786], represent- 
ing the relaxase and coupling protein, respective- 
ly, is noteworthy. However, the presence of two 
genes with an equivalent function on pNanh_F35 
(Nanh_0082, Nanh_0080) is indicative for the mo- 
bilization of the smallest plasmid. The RepA-I type 
replicon pNanh_C61 contains a complete 
rhamnose operon [66] and it is dominated by 
genes that are required for polysaccharide biosyn- 
thesis. Finally, the presence of CRISPRs [Clustered 
Regularly Interspaced Short Palindromic Repeats) 
that provide acquired resistance against viruses 
[67] on the 56 kb replicon pNanh_E56 is noticea- 
ble. The circularity of this scaffold has not been 
validated experimentally, but the proven circulari- 
ty of the chromosome supports the localization of 
these CRISPRs on a plasmid. 



Table 6. General genomic features of the chromosome and extrachromosomal replicons 

* 

from L. nanhaiensis strain DSM 24252 . 



Replicon 



Scaffold Replicase Length (bp) GC (%) Topology No. Genes* 



Chromosome 
pNanh_A236 

pNanh_B92 



1 DnaA 

2 RepC-5 

DnaA-like 1 
RepA-IVc 



4,411,1 77 
236,302 

92,007 



61 circular 
61 linear* 

58 circular 



4,358 
236 

101 



pNanh_C61 


4 


RepA-l 


60,519 


63 


circular 


58 


pNanh_D58 


5 


RepC-9 


57,777 


60 


circular 


57 


pNanh_E56 


6 


RepA-IVb 


55,854 


59 


linear* 


45 


pNanh_F35 


7 


RepA-IVa 


34,914 


62 


circular 


41 



*circularity not experimentally validated; 'deduced from automatic annotation. 
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Table 7. Integrated Microbial Genome (IMG) locus tags of L. nanhaiensis DSM 24252^genes^ 


Replicon 


Replication 


initiation 


Plasmid stability 


Type IV Secretion 




Replicase 


Locus tag 


Toxin Antitoxin 


VirB4 VirD4 


Chromosome 


DnaA 


Nanh_ 


.3012 




Nanh_3787* 


pNanh_A2 36 


RepC-5 


Nanh_ 


.4695 


Nanh_4700 Nanh_4699 


Nanh_4884 Nanh_4897* 


pNanh_B92 


unaA-iiKe i 
RepA-IVc* 


Nanh_ 
Nanh_ 


.0132 
_01 12 


- 


- 


pNanh_C61 


RepA-l 


Nanh_ 


_45 77 


- 


- 


pNanh_D58 


RepC-9 


Nanh_ 


_4622 






pNanh_E56 


RepA-IVb* 


Nanh_ 


.0004 






pNanh_F35 


RepA-IVa* 


Nanh_ 


.0078 




Nanh_0080* 



^Genes for the initiation of replication, toxin/antitoxin modules and two representatives of type IV secretion 
systems (T4SS) that are required for conjugation. 



*solitary replicase without partitioning module 
#presence of adjacent DNA relaxase VirD2 

Emended description of Leisingera nanhaiensis 
DSM 24252^ Sun et al. 201 0 

Leisingera nanhaiensis [nan.hai.en'sis. N.L. fern, 
adj. nanhaiensis referring to Nanhai, the Chinese 
name for the South China Sea, from where the 
type strain was isolated). 
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